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TETRACYCLINE REPRESSOR— MEDIATED BINARY 

REGULATION SYSTEM FOR CONTROL OF 
GENE EXPRESSION IN TRANSGENIC ANIMALS 



1- INTRODUCTION 
The present invention relates to a tetracycline 
repressor-mediated binary regulation system for the 
control of gene expression in transgenic animals. It 
is based, at least in part, on the discovery that, in 
a non-human transgenic animal that carries a first 
transgene under the control of a modified promoter 
comprising a tetR operator sequence and a second 
transgene encoding ttif tetR repressor protein, 
expression of the fir^t transgene may be efficiently 
induced by administering tetracycline to the animal. 

2. BACKGROUND OF THE INVENTION 

2,1. CONTROL OF GENE EXPRESSION 
IN TRANSGENIC ANIMALS 

The production of transgenic animals for both 
experiment and agricultural purposes is now well known 
(Wilmut et al w 7 July 1988, New Scientist pp. 56-59). 
In research, transgenic animals are a powerful tool 
that have made significant contributions to our 
understanding of many aspects of biology and have 
contributed to the development of animal models for 
human diseases (Jaenisch, 1988, Science 240 : 1468- 
1474). It is also clear that several livestock 
species can be made transgenic and these species 
promise to expand and revolutionize the method of 
production and diversity of pharmaceutical products 
available in the future, in addition to improving the 
agricultural qualities of the livestock species 
(Wilmut et al . , supra ) • 

A critical, often neglected, aspect of developing 
transgenic animals is the process whereby expression 
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of the newly introduced gene, referred to as the 
transgene, is controlled. This is an important 
process since stringent regulation of transgene 
expression is often important both for practical, 
regulatory and safety reasons and to maintain the 
health of the transgenic animal. In the past either 
"inducible" or "tissue specific" regulatory mechanisms 
have been used. Inducible regulation is defined 
herein as a method of gene regulation which allows for 
some form of outside manipulation of the onset and/ or 
level of transgene expression. Tissue specific 
regulation is def ined\ herein as a method for targeting 
transgene expression to particular tissues or organs. 



Inducible gene regulation may be achieved using 
relatively simple promoter systems such as the 
metallothionein heat shock promoters, or by using 
promoters which are responsive to specific compounds 
such as the Mouse mammary tumor virus LTR which is 
responsive to glucocorticoid stimulation. More 
flexible, though more complex inducible regulation 
systems can be achieved through a "binary" gene 
approach which utilizes a transact iva tor gene product 
to control expression of a second gene of interest. 
Tissue specific gene regulation usually consists of 
simple single gene methods (Byrne et al., 1989, Proc. 
Natl. Acad. Sci. U.S.A. 86:5473-5477; Ornitz et al. , ,r* 
1991, Proc. Natl. Acad. Sci. U.S.A. 88:698-702), 
although binary transactivator systems can also 
provide a high degree of tissue specificity. 

These current systems provide only a limited 
ability to control the time of transgene expression 
within individual animals. In this respect tissue 
specific promoter elements provide no method to 
control the onset of transgene activity, but function 
merely to target gene expression to defined sites. 
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Simple inducible promoters such as metallothionein 
generally lack tissue specificity and usually have 
some aspect of endogenous basal expression which 
5 cannot be controlled. Thus even for the extensively 
used inducible metallothionein promoter this approach 
at best only permits selection of the time at which a 
relative increase in transgene expression can be 
induced . 

10 Binary transact ivat ion systems typically consist 

of two transgenic animals. One animal contains the 
gene of interest controlled by a promoter element that 
requires a specific transactivator gene product for 
expression. Thus, th^ gene of interest is not 

15 expressed in the absence of the transactivator. A 

second transgenic animal is then made which expresses 
the required transactivator in the desired tissue. By 
mating these two transgenic animals, offspring 
containing both the gene of interest and the 

20 transactivator transgene can be produced. Only in 
these doubly transgenic animals is the gene of 
interest expressed. Since expression of the gene of 
interest requires the transactivator, this binary 
approach dramatically reduces or eliminates any 

25 undesirable basal expression inherent in simple 

inducible systems. Additionally, if expression of the 
transactivator is targeted using a tissue specific 
promoter, then in the double transgenics, expression 
of the gene of interest is in effect targeted to the 

30 same specific tissue. Binary systems provide 
therefore a low resolution method of temporal 
regulation in as much as they allow the determination 
of which generation of animals will express the gene 
of interest. These systems provide little ability, 

35 however, to control the time and level of gene 

expression within an individual transgenic animal. 
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For many applications it is necessary to 
accurately control the time and pattern of transgene 
expression within an individual transgenic animal. 
5 For example, many attempts have been made to produce 
transgenic pigs which express increased levels of 
growth hormone (Vize et al. , 1988, J, Cell Sci. 
£0:295-300;; Pinkert et al. , 1990, Dom. Animal 
Endocrinol. 7:1-18). Elevated growth hormone levels 
10 dramatically decrease the amount of body fat in pigs, 
and increase the animals overall feed efficiency. 
These effects would be beneficial, both to the 
consumer who could purchase a leaner, healthier 
product, and to the producer who can profit from 
15 having a more efficient animal. To date however, all 
attempts to increase the level of growth hormone 
through production of transgenic pigs have also 
produced serious pathological conditions which greatly 
reduce the health of the animals. These pathologies 
20 are the direct result of uncontrolled, constitutive 

expression of growth hormone, since many studies using 
exogenous hormone administration for short periods of 
time have not produced pathologies, while still 
benefiting feed efficiency and fat content. in this 
25 situation, a regulatory method to control onset and 
level of expression from a growth hormone transgene 
would be extremely useful. 

2.2. REPRESSOR— MEDIATED GENE CONTROL 
Transcriptional repressors are usually allosteric 
DNA binding proteins with at least two functional 
sites. One site on the protein is used to bind DNA. 
The DNA binding site binds to a defined DNA sequence 
which is known as the operator site. Operator sites 
usually consist of palindromic sequences of 12 or more 
base pairs. A gene which is regulated by a repressor 
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of the newly introduced gene, referred to as the 
transgene, is controlled. This is an important 
process since stringent regulation of transgene 
expression is often important both for practical, 
regulatory and safety reasons and to maintain the 
health of the transgenic animal, in the past either 
" inducible" or "tissue specific" regulatory mechanisms 
have been used. Inducible regulation is defined 
herein as a method of gene regulation which allows for 
some form of outside manipulation of the onset and/ or 
level of transgene expression. Tissue specific 
regulation is def ined* herein as a method for targeting 
transgene expression t^o particular tissues or organs. 

Inducible gene regulation may be achieved using 
relatively simple promoter systems such- as the 
metal lothionein heat shock promoters, or by using 
promoters which are responsive to specific compounds 
such as the Mouse mammary tumor virus LTR which is 
responsive to glucocorticoid stimulation. More 
flexible, though more complex inducible regulation 
systems can be achieved through a "binary" gene 
approach which utilizes a transact i vat or gene product 
to control expression of a second gene of interest. 
Tissue specific gene regulation usually consists of 
simple single gene methods (Byrne et al., 1989, Proc. 
Natl. Acad. Sci. U.S.A. 86:5473-5477; Ornitz et al. , 
1991, Proc. Natl. Acad. Sci. U.S.A. 88:698-702), 
although binary transacfcivator systems can also 
provide a high degree off tissue specificity. 

These current systems provide only a limited 
ability to control the time of transgene expression 
within individual animals. In this respect tissue 
specific promoter elememts provide no method to 
control the onset of transgene activity, but function 
merely to target gene eacpression to defined sites. 
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must have at least one operator site located within 
its promoter /regulatory region. A second site on the 
repressor protein binds a specific ligand, usually a 
small macromolecule such as an amino acid, sugar, or 
antibiotic. When the ligand is bound to the 
repressor, it causes a conformational shift such that 
the affinity of the repressor for the operator 
sequence is greatly reduced. For this reason, the 
ligand is frequently referred to as the "inducer", 
since it causes the repressor to disassociate from the 
operator, thereby eliminating the repressor's effect 
and allowing expression of the gene. 

Only the bacterial repressors Lacl, LexA and tetR 
have been shown to function in mammalian (Lacl and 
LexA) or plant (tetR) tissue culture cells. The first 
report of utilizing bacterial repressors in eukaryotes 
was from Brent and Ptashne who showed that LexA could 
function in yeast (1984, Nature 312:612-615). 
Subsequently, both LexA and Lacl have been shown to 
function in mammalian tissue culture systems (Smith et 
al., 1988, EMBO J. 7:3975-3981). Of these repressors 
Lacl has been most extensively studied. For Lacl 
repression, single or multipie operator sites have 
been positioned in three major locations: (i) between 
the transcription start site and the first codon of 
the mRNA; (ii> between the TATA-box sequence and the 
transcription start site; and (iii) between the TATA- 
box sequence and any more distal regulatory signal 
sequences. These studies reveal two predominant 
results. First, operators located in all three 
positions were effective in rendering the modified 
promoter subject to Lacl repression. Second, the 
presence of multiple operator sequences allowed 
greater levels of repression than did single operator 
insertions. From these studies it appears the Lacl 
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administering -tetracycline to a non-human transgenic 
animal that carries a first transgene, which is the 
gene of interest under the control of a promoter 
modified to comprise a tetR operator sequence and a 
second transgene encoding the tetR repressor protein. 

The present invention offers the advantage that, 
in the absence of tetracycline , expression of the gene 
of interest occurs at only very low levels due to 
efficient repression by tetR. In preferred, non- 
limiting embodiments of the invention, repression by 
tetR is further enhanced by utilizing a synthetic tetR 
gene which is devoid of splice signals and has 
optimized codon usage for mammalian cells. 
Accordingly, the present invention allows tight 
control of gene expression in transgenic animals by 
withholding or administering tetracycline. 



4 . DESCRIPTION OF THE FIGURES 
Figure 1. A. Nucleotide sequence of tetR operator 

as it occurs in TnlO, and in the oligonucleotides 
used to produce the modified PEPCK promoter 
elements. Bold face lettering represent the OP1 
and OP2 tetR binding sites. The general purpose 
oligonucleotide is the sequence from pdd7 . The 
flanking EcoRI and AccI restriction sites used to 
excise this operator sequence are indicated. 
Additional restriction sites present in the 
plasmid, but not indicated here, which can be 
used to excise the operator include PstI, BamHI, 
Spel, Sbal, NotI, EagI, SacII, BstXI , and SacI on 
the 5 • side and Xhol , Apal and Kpnl on the 3 • 
side. The sequence of the PEPCK-TATA box 
operator is also indicated (see methods) . 
Figure 1. B. Nucleotide sequence of the ddl 
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original tetR responsive CAMV promoter, in which the 
operator sites flank the TATA-box into transgenic 
tobacco plants. Unexpectedly, this promoter, which 
5 exhibited very good regulation in tissue culture 
assays was not very effective in regulating gene 
expression in transgenic plants . Instead they found 
that effective repression and induction in transgenic 
plants occurred when the operator sites were 
xo positioned just downstream of the normal transcription 
start site. 

3. SUMMARY OF THE INVENTION 
The present invention relates to a tetracycline 

15 repressor-mediated binary regulation system for the 
control of gene expression in non-human transgenic 
animals. It is based, at least in part, on the 
discovery that in transgenic mice carrying two 
transgenes, the first encoding bovine growth hormone 

20 (bGH) under the control of a PEPCK promoter modified 
to comprise the tetR operator sequence at the Nhel 
site, and the second encoding tetR repressor protein 
under the control of an unmodified PEPCK promoter, 
expression of bGH could be efficiently and selectively 

25 induced by administering tetracycline to the 
transgenic mice. 

In particular embodiments, the present invention 
provides for (i) animal promoter elements modified to 
comprise a tetR operator sequence; (ii) nucleic acid 

30 molecules comprising a gene of interest under the 

control of such a modified promoter; (iii) non-human 
transgenic animals that carry a transgene under the 
control of said modified promoter and/or a transgene 
encoding the tetR repressor protein; and (iv) a method 

35 of selectively inducing the expression of a gene of 
interest in a non-human transgenic animal comprising 
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start site of bGH from the Pck_N promoter. Total 
liver RNA (10/xg) was hybridized to a 280 bp 5* 
labelled probe from the Pck_NbGH gene in 40mM 
PIPES (Ph6.4), IMm EDTA, 400mM NaCl / 80% 
formamide at 55° overnight. The probe spanned 
from the Hinfl site in the 5 1 untranslated leader 
sequence of bGH to the PvuII site 5' of the TATA- 
17 box. The probe includes the tet-operator 
sequence of Pck_N (see Figure 3). After 
hybridization 3 00 pi of ice cold digestion buffer 
(280mM NacL, 50Mm SODIUM ACETATE (Ph4.5), 4 . 5Mm 
ZnS0 4 , 2 0/xg/ml carrier DNA and 500 units SI 
nuclease) was added and incubated at 37° for 30 
minutes. The reaction as stopped by adding 80^1 
of Stop Buffer (4M Ammonium acetate, 50mM EDTA 
and 50/xg/ml tRNA) , extracted with 
phenol/ chloroform, precipitated with ethanol and 
analyzed on a 6% sequencing gel. The arrow 
indicates the protected fragment. Initiation of 
bGH mRNA from the modified Pck_N promoter occurs 
approximately 20 bp 3 1 iof the TATA- box. This 
initiation site places Vthe start of the message 
just prior to the first' tetR binding site. This 
result indicates that the bGH mRNA starts from a 
single cap site, and suggests that tetR 
repression is due to a block in transcription 
initiation. Furthermore, unrepressed bGH 
expression appears to be due to limited tetR 
expr e s s i on • 

Figure 5. Nucleotide sequence of the tetR repressor 

protein gene. 
Figure 6. Alterative, nonlimiting promoters of 

interest. Asterisks indicate sites at which tetR 

operator sequence may be inserted. 
Figure 7. Northern blot analysis of bGH mRNA in liver 
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operator. Lower case letters correspond to 
polylinker sequence. The 5' EcoRl and 3« AccI 
restriction sites used for producing the modified 
5 PEPCK promoters (Pck_A and Pck-N) are indicated. 

The 10 base pair linker beween 0P1 and OP2 is 
underlined. Additional polylinker restriction 
sites available in pdd7 include PstI , BamHI, 
Spel, Xbal, NotI, EagI, SacII, BstXI,and SacI on 
xo th e 5 1 side and Xhol, Apal and Kpnl on the 3 1 

side. 

Figure 2. A representation of the three modified 

PEPCK promoter elements. Construct 251 has the 
337 operator sequence integrated in the AccI site 

15 of PEPCK , just 5» of the TATA- box control 

element. Construct 252 has the ddl operator 
sequence incorporated into the Nhel site of 
PEPCK, just 3 9 of the TATA-box element. 
Construct 261 incorporates the TATA-specif ic 

20 operator sequence which is integrated between the 

5» AccI site and the 3 1 Nhel sites. 
Figure 3. Structure of the ;modif ied PEPCK controlled 
bovine growth hormone denes. The Pck_AbGH and 
Pck_NbGH genes differ ohly in the site of 

25 operator insertion. For Pck_AbGH the operator is 

inserted at the AccI site 5 1 of the PEPCK TATA- 
box element. For Pck_NbGH the operator is 
inserted into the Nhel site 3 1 of the TATA-box 
element (pPCK_NbGH has been deposited with the 

30 ATCC and assigned accession No: ). In the 

Pck__TbGH gene, a TATA-box specific 
oligonucleotide was used, and this sequence was 
inserted between both the AccI and Nhel sites. 
A. Indicated the probe used for SI hybridization. 

35 Figure 4. SI Nuclease protection assay to map the 5 % 
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TZ-1 : 5 • CCGCATATGATCAATTCAAGGCCGAATAAG3 1 
TZ-3 : 5 1 CTTTAGCGACTTGATGCTCTTGATCTTCCA3 1 
l»Z-4 : 5 • AATTCGCC AG CC ATG CC AAAAAAG AAG AGG 3 • 
5 The TZ-4 primer is common to both primer pairs 

and is the 5 1 primer which encompasses the start 
codon of the tetR and mRNA. Primer TZ-1 and TZ-3 
are two different 3 1 primers both of which are in 
the tetR coding region. When amplified, these 

10 primer pairs produced smaller then expected 

products (approx. 215bp vs. 619bp for TZ-4 and 
TZ-1, and approx. 94bp vs. 498bp for TZ-4 and 
TZ-3) . The products of this reaction were cloned 
and sequenced. Sequencing revealed the presence 

15 of an unexpected intron which spanned from near 

the Xbal site at the start of tetR to a splice 
acceptor just 8 base pairs 5 1 of the TZ-3 primer. 
Figure 14. Composition analysis of Wild Type TnlO 
tetR gene. The TnlO tetR coding sequence was 

2 0 analyzed on a desktop computer using Mac Vector 

software. The figure shows a diagram of the tetR 
coding region with the .plus strand splice doner 
(D) and splice acceptor^ (A) signal sequences 
indicated. For reference the location of the 

25 Xbal restriction is also indicated. The first 

graph depicts the percentage of G and C bases in 
the coding region of tetR. There are several 
domains of very low GC content. The bottom graph 
is an analysis of codon bias. The dark line is a 

30 comparison of the tetR codon usage to a mouse 

codon bias table. Values lower than 1.0 are 
indicative of sequences which may translate 
poorly. For reference, a comparison of tetR to a 
Tobacco codon bias table is included (light 

35 line) . In transgenic tobacco, the tetR 

regulation system functions very efficiently. 



BNSDOCID: <WO 9404672A1 I > 



WO 94/04672 



- 11 - 



PCT/US93/08230 



of Fl generation animals. 
Figure 8* Northern blot analysis of bGH mRNA 
expression in four transgenic lines, 
5 Figure 9A. Tissue specificity of bGH expression in 
Line 10-2 in the presence of 50 Mg/ml 
tetracycline. Northern blot analysis of bGH 
induction in a variety of tissues . Only the 
liver and kidney show significant expression. 
10 Figure 9B. Tetracycline induction of bGH in Line 10-2. 

Both liver and kidney, which are the only sites 
for bGH expression in Figure 9A, also show 
tetracycline dependent bGH expression. 
Figure 10. 345 Repressor Construct. 
15 Figure 11. Induction of bGH expression in Construct 
345 Offspring. Northern blot analysis of liver 
RNA from Fl animals containing the 345 construct. 
Only animals from line 14 exhibit tetracycline 
dependent bGH expression. 
20 Figure 12. Expression and alternative processing of 
tetR transgene. A RNase protection probe which 
extends from the Nrul site of tetR 3 • to the end 
of the gene was used. This probe includes only 
tetR coding sequences arid should give a fully 
25 protected fragment of approximately 400 base 

pairs. A protected fragment of approximately 
220-260 base pairs is observed, which is far 
smaller then predicted. 
Figure 13. 5 1 Structure of tetR mRNA. Liver RNA was 
30 treated with reverse transcriptase and amplified 

by PCR. The RNA was amplified using two 
different pairs of primers. The first primer 
pair (T2-1 and TZ-4) should produce a 619 base 
pair product. The second primer pair (T203 and 
35 TZ04) should produce a 498 base pair product. 

The sequence of the primers are: 
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5. DETAILED DESCRIPTION OF THE INVENTION 
For purposes of clarity of description, and not 
by way of limitation, the detailed description of the 
5 invention is divided into the following subsections: 

(i) the tetR operator; 

(ii) modified promoters containing the tetR 
operator ; and 

(iii) utility of the invention. 

10 

5.1. THE TETR OPERATOR 
In order to practice the instant invention , the 
tetR operator sequence is inserted into a suitable 
animal promoter sequence in order to render that 
15 promoter subject to control by tetR repressor protein, 
A diagram of the tetR operator sequence is depicted in 
Figure 1. 

It may be convenient to clone the tetR operator 
into a vector, such as a plasmid or a phage, to 

20 facilitate its propagation. Cloned operator sequence 
may then be rendered available for insertion into a 
promoter of interest, as set forth in Section 5.2. , 
infra. \ 

In a particular, nonlimiting embodiment of the 

25 invention, tetR operator sequence may be cloned as 

follows: Four oligonucleotides, which when annealed 
produce the two 19bp 0P1 and OP2 palindromic sequences 
of the tetR operator may be synthesized; the sequences 
of said oligonucleotides are as follows: 

30 X- 1 . 5 1 ACTCTATCATTGATAGAGT3 • 
X-2 . 5 ■ ACTCTATCAATGATAGAGT3 » 
X-3 . 5 1 TCCCTATCAGTGATAGAGA3 1 
X-4 . 5 1 TCTCTATCACTGATAGGGA3 1 

Oligonucleotides X-l and X-2 are complementary and, 
35 when annealed, form the OP1 operator. Similarly, 

oligonucleotides X-3 and X-4, when annealed, produce 
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suggesting that for this gene, codon bias may be 
an important factor for efficient expression. 
Figure 15. Synthetic tetR Component Sequences. The 
5 components of the synthetic tetR gene were 

synthesized by Midland Laboratories as four 
overlapping double stranded DNA cassettes. The 
sequence of these cassettes are shown. Each 
cassette was blunt cloned into the Hinc2 site of 
10 pUC19 and sequenced to verify authenticity. The 

resulting plasmids pLTl, pLT2 , pLT3 and pLT5 can 
be used as the source material to assemble the 
entire synthetic tetR coding sequence since each 
contains an overlapping unique restriction site 
15 (bold face) through which they can be joined. 

Figure 16. Sequence of Synthetic tetR gene. 
Figure 17. Composition analysis of synthetic tetR. 
These graphs were produced using the same 
software described in Figure 15. The figure 
20 depicts the structure of the synthetic tetR gene, 

now devoid of splice donor signal sequences, with 
only a single splice acceptor signal remaining 
(A) . This is not the Splice acceptor which was 
active in the 345 construct. The percentage of G 
25 a nd C bases has been significantly improved, 

while the frequency of CpG base pairs has been 
kept to a minimum. A CpG base pair is frequently 
the site for DNA methylation, which can 
negatively effect the expression of a gene. The 
30 codon bias of the synthetic tetR gene is also 

vastly improved. The graph depicts the results 
when the synthetic tetR coding sequence is 
compared to the same mouse codon bias table used 
previously. 

35 
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For- insertion of the operator sequence, the PEPCK 
promoter may be cut with Nhel and end-filled with T4 
polymerase; tetR operator, prepared as set forth in 
section 5.1., supra , may then be blunt-ligated into 
place. 

5.3. UTILITY OF THE INVENTION 
5.3.1. STRATEGY 
The strategy of the invention is to prepare a 
non-human transgenic animal that comprises two 
transgenes. The first transgene, termed "A," is a 
gene of interest, the expression of which is desirably 
controlled. Virtually any gene of interest may be 
used, including, but not limited to, growth hormone, 
hemoglobin, low density lipoprotein receptor, insulin, 
genes set forth in Table I, etc. 

TABLE 1 
Other Genes Of Interest 
Gene Disease/ Af feet 

ADA Adenosine deaminase Immuno-def iciency 

TNF Tumor necrosis factor i Anti-cancer 
IL-2 Interleukin-2 ^ Anti-cancer 

LDL low density < hypercholesterolemia 

Factor IX hemophelia 
Factor VIII hemophelia 
/3-glucosidase Gauchers disease 

CFTR Cystic fibrosis Cystic fibrosis 

transmembrane regulator 
HPRT Hypoxanthine-guanine Lesch-Nyhan syndrome 

phosphor ibosy 1 transf erase 
UDP-glucuronyl transferase Crigler-Na j jar syndrome 
Growth Hormone receptor Growth 
Insulin-like growth factor Growth 
Growth hormone releasing Growth 

factor 
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the OP2 operator site. The OP1 oligonucleotides may 
then be directly cloned into the EcoRV site of the 
Bluescript (Stratagene) polylinker to form plasmid X. 
5 OP2 oligonucleotides may then be cloned into a Mung 
bean nuclease blunted Clal site of plasmid X to form 
plasmid Y. The resulting tetR operator may then be 
propagated and then excised from plasmid Y as an 
EcoRl, Accl fragment which may be end-filled with T4 
10 polymerase and gel purified. 

It is preferable that the separation between OP1 
and OP2 is about 10-11 bp. 

Analogous methods may be used to insert the tetR 
operator site into other suitable vectors. 

15 

5.2. MODIFIED PROMOTERS CONTAINING 
THE tetR OPERATOR 

According to the invention, the tetR operator may 
be inserted into a suitable animal promoter so as to 

20 render that promoter subject to repression by tetR 
repressor protein. Any animal promoter maybe used; 
strategies for promoter selection are set forth in 
Section 5.3. , infra. \ 

In preferred embodiments of the invention, the 

25 "tetR operator sequence is positioned 3 1 to the TATA- 
box sequence. A nonlimiting list of promoters which 
may be used according to the invention is set forth in 
Figure 6, together with the proximal portion of the 
promoter in the vicinity of the TATA-box, which is 

30 underlined. 

In a specific, nonlimiting embodiment of the 
invention, the tetR operator site may be inserted into 
the Nhel site of the PEPCK promoter (Wynshaw-Boris et 
al., 1984, J. Biol. Chem. 259:12161-12169). A diagram 

35 of the PEPCK promoter containing the tetR operator 
sequence of the Nhel site is presented in Figure 2. 
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9 , , and , as described in 

Section 7, infra . 

In further embodiments, the present invention 
provides for additional synthetic tetR genes from 
which one or more splice sites have been deleted or 
for which codon usage has been further optimized. 

The present invention covers synthetic tetR genes 
having the sequence set forth in Figure 16 and for 
functionally equivalent variants of that sequence. 

In specific, non-limiting embodiments of the 
invention, a nuclear localization signal may be added 
to a natural or synthetic tetR gene to facilitate its 
expression ( See , Section 7, infra ) . 

Expression of tetR is controlled by promoter "C". 
While it is preferable that promoter C be the same as 
promoter B except that promoter C does not contain a 
tetR operator sequence, any promoter which provides 
expression of tetR so as to repress expression of gene 
"A" during the period when it is desirable to repress 
expression of M A" may be used. 

For example, and not by way of limitation, a 
transgenic animal may be produced which carries a 
first transgene which is bovine growth hormone under 
the control of a PEPCK promoter modified to contain a 
tetR operator sequence at the Nhel site and a second 
transgene which is tetR repressor protein under the 
control of an unmodified PEPCK promoter; see Section 
6, infra . The pPCK_NbGH construct has been deposited 
with the ATCC and assigned accession number . 

5.3.2. TRANSGENIC ANIMALS OF 
THE INVENTION 

The binary repressor system of the invention may 
be used to control gene expression in any non-human 
transgenic animal, including, but not limited to, 
transgenic mice, pigs, goats, cows, rabbits, sheep, 
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The expression of gene "A" is under the 
transcriptional control of promoter "B" . Promoter B 
comprises a tetR operator sequence, as discussed 
5 supra . Promoter B desirably defines the time and 
tissue window in which the transgene may be induced; 
for example, promoter A may be a tissue specific 
promoter such as the PEPCK promoter (which is 
expressed selectively in liver and becomes active 

10 shortly prior to birth) . The second transgene encodes 
the tetR repressor, the sequence of which is set forth 
in Figure 5. 

Analysis of the TnlO tetR coding sequence 
indicates that the codon usage for this gene is poorly 
suited for expression in mammalian cells (FIG. 15) . 
To optimize tetR expression in mammalian cells a new 
tetR repressor gene was designed ( See , Section 7, 
infra) , which may be utilized in alternative 
embodiments of the invention. The synthetic tetR gene 

20 (syn-tetR) is designed to encode exactly the same 

protein product as the bacterial TnlO tetR gene but 
optimizes codon usage for mammalian cells. The 
percentage of G and C bases \has been significantly 
improved, while the frequency of CpG base pairs has 

25 been minimized. A CpG base pair is frequently the 
site for DNA methylation which can negatively affect 
the expression of a gene. In addition , the syn-tetR 
gene is devoid of any splice signals, decreasing the 
likelihood of aberrant splicing of the RNA which may 

30 result in production of a non-functional message. The 
sequence of the synthetic tetR gene is depicted in 
Figure 16. Plasmids comprising these sequences may be 
constructed using plasmids pLT-1, pLT-2, pLT-3 and 
pLT-5 (deposited with the American Type, Culture 

35 Collection (ATCC) and assigned accession numbers 
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for 12 to 14 days. On the last day of AT feeding all 
gilts may be given an intramuscular injection of 
prostaglandin (Lutalyse: lOmg/ injection) at 0800 
5 and 1600 hours* Twenty-four hours after the last day 
of AT consumption all donor gilts may be administered 
a single intramuscular injection of pregnant mare 
serum gonadotrophin (1500 U) • Human chorionic 
gonadotrophin (750 IU) may be administered to all 
10 donors at 80 hours after pregnant mare serum 
gonadotrophin . 

Following AT withdrawal , donor and recipient 
gilts may be checked twice daily for signs of estrus 
using a mature boar. Donors which exhibited estrus 
!5 within 36 hours following human chorionic 

gonadotrophin administration may be bred at 12 and 24 
hours after the onset of estrus using artificial and 
natural (respectively) insemination . 

Between 59 and 66 hours after the administration 
20 of HCG one- and two-cell ova may be surgically 
recovered from bred donors using the following 
procedure. General anesthesia may be induced by 
administering 0.5 mg of acepromazine/kg of bodyweight 
and 1.3 mg of ketamine/kg via a peripheral ear vein. 
25 Following anesthetization, the reproductive tract may 
be exteriorized following a mid-ventral laparotomy. A 
drawn glass cannula (O.D. 5 mm, length 8 cm) may be 
inserted into the ostium of the oviduct and anchored 
to the infundibulum using a single silk (2-0) suture. 
Ova may then be flushed in retrograde fashion by 
inserting a 20g needle into the lumen of the oviduct 2 
cm anterior to the uterotubal junction. Sterile 
Dulbecco^ phosphate buffered saline (PBS) 
supplemented with 0.4% bovine serum albumin (BSA) may 
be infused into the oviduct and flushed toward the 
glass cannula. The medium may be collected into 



30 



35 
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etc. The present invention provides for such non- 
human transgenic animals earring as transgenes nucleic 
acid constructs described herein, including natural or 
5 synthetic tetR repressor proteins and operator 
sequences • 

Transgenes may be introduced by microinjection, 
transf ection, transduction, electroporation, cell gun, 
embryonic stem cell fusion, or any other method known 
i0 in the art. The transgenes of the invention may be 
co-introduced into a single animal or may be 
introduced into two individual animals that are 
subsequently mated to produce doubly transgenic 
offspring. 

X5 For example, for the production of transgenic 

mice, the following general protocol may be used. 
Male and female mice are mated at midnight* Twelve 
hours later, the female may be sacrificed and the 
fertilized eggs may be removed from the uterine tubes. 

20 Foreign DNA may then be microinjected (100-1000 
molecules per egg) into a pronucleus. Shortly 
thereafter, fusion of the pronuclei (a pronucleus or 
the male pronucleus) occurs ,\ and, in some cases, 
foreign DNA inserts into (usiially) one chromosome of 

25 the fertilized egg or zygote. The zygote may then be 
implanted into a pseudo-pregnant female mouse 
(previously mated with a vasectomized male) where the 
embryo develops for the full gestation period of 20-21 
days. The surrogate mother then delivers the mice and 

30 four weeks transgenic pups may be weaned from the 

mother . 

According to another embodiment of the invention, 
a transgenic pig may be produced, briefly, as follows. 
Estrus may be synchronized in sexually mature gilts 
35 (>7 months of age) by feeding an orally active 

progestogen (e.g. allyl trenbolone, AT: 15mg/gilt/day) 
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mis of BMOC-3 medium may be aspirated into the tubing. 
The tubing may then be fed through the ostium of the 
oviduct until the tip reaches the lower third or 
5 isthmus of the oviduct. The ova may be subsequently 
expelled as the tubing is slowly withdrawn. The 
exposed portion of the reproductive tract may be 
bathed in a sterile 10% glycerol - 0.9% saline 
solution and returned to the body cavity. The 

10 connective tissue encompassing the linea alba, the 
fat, and the skin may be sutured as three separate 
layers. An uninterrupted Halstead stitch may be used 
to close the linea alba. The fat and skin may be 
closed using a simple continuous and mattress stitch, 

15 respectively. A topical antibacterial agent (e.g. 

Furazolidone) may then be administered to the incision 
area. Recipients may be penned in groups of about 
four and fed 1.8 kg of a standard 16% crude protein 
corn-soybean pelleted ration. Beginning on day 18 

20 ( da Y 0 = onset of estrus) , all recipients may be 

checked daily for signs of estrus using a mature boar. 
On day 35, pregnancy detection may be performed using 
ultrasound. On day 107 of gestation recipients may be 
transferred to the farrowing suite. In order to 

25 ensure attendance at farrowing time, farrowing may be 
induced by the administration of prostaglandin (10 
mg/injection) at 0800 and 1400 hours on day 112 of 
gestation. In all cases, recipients may be expected 
to farrow with 34 hours following PGF 2a 

30 administration. 

As used herein, the term "transgenic animal" 
refers to animals that carry a transgene in at least 
some of their somatic cells, and preferably in at 
least some of their germ cells. 

35 
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sterile 17 x 100 mm polystyrene tubes. Flushings may 
be transferred to 10 x 60 mm petri dishes and searched 
at a lower power (50x) using a Wild M3 
5 stereomicroscope . All one- and two- cell ova may be 
washed twice in Brinster's Modified Ova Culture -3 
medium (BMOC -3) supplemented with 1.5% BSA and 
transferred to 50 fil drops of BMOC-3 medium under oil. 
Ova may be stored at 38 °C under a 90% N z , 5% O z , 5% Co 2 

10 atmosphere until microinjection is performed. One and 
two-cell ova may be placed in an Eppendorf tube (15 
ova per tube) containing 1 ml HEPES medium 
supplemented wit 1.5% BSA and centrifuged for 6 
minutes at I4,000g in order to visualize pronuclei in 

15 one-cell and nuclei in two-cell ova. Ova may then be 
transferred to a 5-10m1 drop of HEPES medium under oil 
on a depression slide. Microinjection may be 
performed using a Laborlux microscope with Nomarski 
optics and two Leitz micromanipulators. 10-1700 

20 molecules of construct DNA (linearized at a 

concentration of about lng//il of Tris-EDTA buffer) may 
be injected into one pronucleus in one-cell ova or 
both nuclei in two-cell ova\ Microin jected ova may be 
returned to microdrops of BMOC-3 medium under oil and 

25 maintained at 38 °C under a 90% N 2 , 5% C0 2f 5% 0 2 
atmosphere prior to their transfer to suitable 
recipients. Ova may preferably be transferred within 
10 hours of recovery. Only recipients which exhibit 
estrus on the same day or 24 hours later than the 

30 donors may preferably be utilized for embryo transfer. 
Recipients may be anesthetized as described supra . 
Tol lowing exteriorization of one oviduct, at least 30 
injected one- and/ or two-cell ova and 4-6 control ova 
may be transferred in the following manner. The 

3 5 tubing from a 21g x 3/4 butterfly infusion set may be 
connected to a Ice syringe. The ova and one to two 



tetracycline, as inducer, is between about 5-50 mg/kg 
and preferably between about 5-15 mg/kg* 

6. EXAMPLE: TETRACYCLINE REPRESSOR— MEDIATED 
BINARY REGULATION SYSTEM FOR CONTROL OF 
BOVINE GROWTH HORMONE EXPRESSION IN 
TRANSGENIC MICE 

6.1. MATERIALS AND METHODS 

6.1.1. CONSTRUCTION OF PLASMIDS 

Plasmid pdd7 contains a functional tetR operator 

site cloned within a Bluescript (Stratagene) 

polylinker. This plasmid is useful for propagating 

the operator sequence, and as a source of operator 

sites for insertion into the PEPCK promoter or any 

other promoter element. The pdd7 plasmid was made as 

follows. Four oligonucleotides, which when annealed 

produce the two 19bp OP1 and OP2 palindromic sequences 

of the tetR operator were synthesized. The sequences 

of each oligonucleotide is listed below. 

X-1.5 1 ACTCTATCATTGATAGAGT 3' 

X-2 . 5 9 ACTCTATCAATGATAGAGT 3 » 

X-3.5 1 TCCCTATCAGTGATAGAGA 3' 

X-4.5 1 TCTCTATCACTGATAGGGA 3 1 
Oligonucleotides X-l and X-2 ' are complementary and 
when annealed form the OP1 operator. Similarly 
oligonucleotides X-3 and X-4 produce the OP2 operator 
site. The OPl oligonucleotides were directly cloned 
into the EcoRV site of the Bluescript polylinker. The 
resulting plasmid pSOPI was sequenced to verify the 
integrity of the insert. OP2 oligonucleotides were 
subsequently cloned into a Mung bean nuclease blunted 
Clal site of pSOPI to produce pdd7 - Due to a cloning 
artifact produced by the Mung bean nuclease, the 
operator in pdd7 consists of the two 19bp OPl and OP2 
sequences separated by linker of only 10 base pairs. 
This difference does not effect tetR binding. The 



\ 
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5*3.3. INDUCTION 
Induction of expression of the gene of interest 
in transgenic animals of the invention may be achieved 
5 by administering , to the animal, a compound that binds 
to tetR so that tetR repressor function is inhibited. 
Examples of such compounds include tetracycline and 
tetracycline-like compounds, including, but not 
limited to, apicycline, chlortetracycline, 
10 clomocycline, demeclocyline, guamecycline, lymecycline, 
meclocycline , methacycline , minocycline , 
oxy tetracycline , penimepicycline , pipacycline , 
rolitetracycline, sancycline, and senociclin. 

Administration of the inducer can be through 
15 direct injection, water, feed, aerosol, or topical 

application. The choice of method will depend on the 
promoters used and the specific application of the 
transgenic animals. For example, injection, water and 
feed would provide inducer to all of the animals 
20 tissues. In our case, administration through water or 
feed would be the preferred method to control growth 
hormone expression in transgenic pigs. Aerosol spray 
could be used to attain high antibiotic concentrations 
in the lung. This may be appropriate for example in a 
25 cystic fibrosis or emphysema model. Topical 

application to the skin is also possible and could be 
used in models of acne, hair loss, wound healing or 
viral infection. 

Induction of the gene of interest is accomplished 
30 by administering an effective amount of inducer, as 
described above. An effective amount of inducer may 
be construed to mean that amount which increases 
expression of the gene of interest by at least about 
50 percent. As the LD^ for tetracycline HC1 in rats 
35 is about 664 3 mg/kg and the therapeutic dose is 
between about 25-50 mg/kg, an effective dose of 
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untranslated DNA. This insert was excised from the 
parent plasmid and subcloned into a plasmid with a 
more suitable polylinker to produce pSTET7 . To this 
plasmid a 870bp Xhol, BamHI fragment derived from pMSG 
(Pharmacia) , containing the SV4 0 small-T intron and 
polyadenylation signal sequences was inserted at the 
Hindu site 3 f of the tetR coding region to produce 
pSTetRSv. Finally an unmodified 610bp PEPCK promoter 
was inserted at the EcoRl site of pSTETRSv to produce 
pPck tetRSv. The PEPCK promoter is identical to the 
promoter used to produce pPck_A, pPck_N, and pPckJT 
except that it does not contain a tetR operator site. 
This PEPCK promoter has been previously used in 
transgenic animals and is known to target gene 
expression specifically to the liver. 

6.1.3. GROWTH HORMONE GENES 
Plasmid pGH-SAF107 contains a 2.2kb BamHI , EcoRI 
genomic fragment of the bovine growth hormone (bGH) 
gene, blunt ligated into an EcoRV site. To this 
vector each of the modified PEPCK promoters was added 
by blunt ligating the promoter into the BamHI site of 
pGH-SAF107. The structure of the resulting plasmids 
is depicted in Figure 3. Plasmid pPCK_NbGH was 
deposited with the ATCC and assigned accession number 

'• For production of transgenic animals, 

each of the PEPCK— bGH genes was excised from the 
vector using Xhol and Sacl, gel fractionated and 
purified using an Elutip column. 

6.1.4. TRANSGENIC MICE 
Transgenic mice were made which contain both the 
Pck_tetRSv gene and one of the modified PEPCK 
promoters controlling bGH. Table 2 lists the number 
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sequence of the pddl operator site is shown in Figure 
IB. The 55 base pair tetR operator was excised from 
p9 d 7 as an EcoRl, AccI fragment, end filled with T4 
5 polymerase, and gel purified. This fragment was 
subsequently used to produce the modified PEPCK 
promoters Pck_N and Pck_A. 

Plasmids Pck_A and Pck_N were produced by 
inserting the 55bp tetR operator into the unique AccI 
10 and Nhel sites (respectively) of the PEPCK promoter 
(pPCK_NbGH has been deposited with ATTC and assigned 
accession No: ) . For both plasmids the promoter was 
cut with the appropriate restriction enzyme, end 
filled with T4 polymerase and the tetR operator blunt 
15 ligated into place* A third modified PEPCK promoter, 
Pck_T was produced in which the OP1 and OP2 operator 
sequences were positioned to flank the PEPCK TATA-box 
element. To produce Pck_T a new oligonucleotide 
(5 ■ ACTCTATCATTGATAGAGTTACTAT 
20 TTAAATCCCTATCAGTGATAGAGA3 • ) was produced. This 

oligonucleotide was kinased with T4 polynucleotide 
kinase and annealed to kinased X-2 and X-4 which are 
complementary to the first <Mid last 19bp. The 
complete double stranded 4 9 bp operator was produced by 
25 filling in the llbp linker region, which includes the 
PEPCK TATA-box element, with Klenow. The final 
product was then blunt cloned into an AccI, Nhel cut 
PEPCK promoter. All three modified promoters were 
sequenced to verify the inserts. Figure 2 depicts the 
30 structure of these promoters. 



6.1.2. REPRESSOR CONSTRUCT 
Plasmid pBI501 contains a 701 bp Hindi fragment 
from E. coli TnlO, cloned into the Hindi site of 
35 pUC8. The Hindi insert contains the entire tetR 

coding sequence along with 21bp of 5 1 and 55bp of 3 1 



BNSDOCID: <WO 9404672A1J_> 



WO 94/04672 



- 28 - 



PCT/US93/08230 



At 10 weeks of age, a sampling of transgenic 
female founders containing the A+T and N+T were tested 
for induction of bGH in the serum using a radio-immune 
assay, after a single IP injection of 60 mg/kg 
tetracycline-HCl. The purpose of this experiment was 
simply to determine which if either of these two 
modified promoters was responsive to repression by 
tetR. The results are summarized in Table 4* 



TABLE 4 



Construct 


Animal 


Weight 


Basal 


12 hours 


36 hours 


249 


2-5 female 


21.1 


0.00 


0.00 


0.00 


250 


6-6 female 


42.9 


4.6+0.033 


3.4+0.062 


4.9+0.072 


251 


6-6 female 


19.3 


0.00 


0.00 


O.OO 


251 


10-5 female 


25.1 


O. 20+0. 008 


O. 19+0. 001 


0.21+0.038 


252 


5-2 female 


38.7 


0.59+0. 107 


0.64+0.044 


1.12+0.207 


252 


5-3 female 


20.0 


0.00 


0.00 


0.00 


252 


10-2 


19.2 


0.00 


0.00 


0.00 



No induction of bGH was observed in animals that lack 
the Pck_tetRSV gene (construct 250) or in animals with 

25 both the Pck_AbGH + Pck-tetRSv genes (construct 251) . 
An approximate two fold increase in serum bGH levels 
was however detected in the 5-2 female which contains 
the Pck-NbGH + Pck_tetRSV genes. The remainder of the 
animals had undetectable levels of bGH expression, due 

30 in part to the relatively low sensitivity of this 
assay. For example the 10-2 female (construct 252) 
shows no detectable bGH in the serum, but subsequent 
experiments on her offspring indicate that this line 
of animals does express bGH mRNA in a tetracycline 

35 dependent manner. This initial data, suggested that 
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of eggs injected, offspring produced and number of 
transgenics derived for each construct. 



TABLE 2 



Construct 


Eggs 

injected 


Eggs 

transferred 


Live 
Born 


Transgenic 


Pck AbGH + Pck tetRSv 
(251) 


233 


194 


40 


14 (0.35) 


Pck-NbGH + Pck tetRSv 
(252) * 


268 


208 


30 


9(0.3) 


Pck TbGH + Pck-tetRSv 
(261) 


227 


197 


25 


5(0.2) 


6.2. 


RESULTS AND 


DISCUSSION 







Once the transgenic founder animals were 
identified, they were weighed each week. Table 3 
lists the mean weights of each group of transgenic 
animal at 11 weeks of age. 



TABLE 3 



Construct 


\. Sex 


Weight 


Pck 


AbGH + Pck_tetRSv(9) 


H 1 

» male 


36. 


122 (12.251) 


Pck_ 


AbGH + Pck_tetRSv(4) 


female 


29. 


125(7.861) 


Pck_ 


NbGH + Pck_tetRSv(5) 


male 


34. 


840(14.745) 


Pck_ 


NbGH + Pck_tetRSv(4) 


female 


28. 


125 (10.958) 


Pck_ 


TbGH + Pck_tetRSv(3) 


male 


36. 


267 (11.402) 


Pck_ 


TbGH + Pck_tetRSv(2) 


female 


27. 


300(5.798) 


NON- 


-TRANSGENIC (6) 


male 


29. 


583 (2.395) 


NON- 


•TRANSGENIC (6) 


female 


23. 


117 (1.863) 



As expected for each co-injection, large animals, 
35 obviously expressing elevated levels of bGH, were 
observed as were animals of normal stature. 
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An SI nuclease protection assay was performed to 
identify the start site of transcription of bGH mRNA. 
As shown in Figure 4, there was only one start site 
identified regardless of the presence or absence of 
tetR repressor binding. This start site was located 
approximately 20 bp downstream from the TATA-box. At 
this location , the message is initiating within the 
ddl operator sequence, just 3 or 4 base pairs 5 f of 
the first tetR binding site. 

7. EXAMPLE; OPTIMIZATION OF t etR CODING SEQUENCE 
The use of the wild type TnlO tetR gene in 
conjunction with the 252 construct indicates that the 
TetR system can function in transgenic animals and 
that in some cases, for instance in the 10-2 
transgenic animals, the level of regulation can be 
very high (FIGS. 9 A and 9B) . However, in other 
instances the efficiency of repression is not always 
complete, leading to a significant basal level of bGH 
expression. This failure to repress may be due to low 
level expression of tetR. To optimize the expression 
of tetR repressor, a synthetic tetR gene was generated 
which was devoid of splice signals and had optimized 
codon usage for mammalian cells. 



7.1 MATERIALS AND METHODS 

7.1.1. TISSUE SPECIFICITY AND TETRACYCLINE 
INTRODUCTION OF bG H IN LINE 1Q-2 

For all Northern blots 10/xg of whole RNA was 
electrophoreses through a 1% agarose gel containing 3% 
formaldehyde using standard techniques. To detect bGH 
mRNA a random primed, radioactive bGH cDNA probe was 
used. All conditions for hybridization and washing of 
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the Pck_N promoter was being regulated by tetR at 
least to a limited extent* 

To further characterize the mice, improve the 
5 sensitivity of the assay and to test the 

responsiveness of the Pck_T promoter, offspring of 
founder mice from each co-injection were produced. 
The transgenic progeny were then raised in the 
presence or absence of tetracycline medicated water 

xo (500/xg/ml) for 4 weeks, prior to analysis of bGH mRNA 
expression levels in the liver, the predominant site 
of PEPCK expression. Northern blot hybridization 
analysis of these animals (Figure 7) demonstrated 
again, that animals with the Pck_NbGH gene were 

25 responsive to repression by tetR and that the other 
two modified promoters exhibited no signs of tetR 
dependent regulation. 

We attempted to breed all of the remaining 
founders containing the Pck-NgGH + Pck_tetRSv genes to 

20 analyze their offspring in a similar manner (Figure 

8) . Of the 5 founders which produced offspring, 2 did 
not express bGH under any conditions, and from the 
remaining 3 one segregated t^wo different integration 
sites allowing us to establish a total of 4 lines. 

25 All 4 lines exhibited tetracycline dependent bGH 

expression as assayed by Northern blot hybridization. 
The efficiency of tetR repression appeared to be 
inversely correlated with the level of expression. 
For example 9-5 animals have the highest level of bGH 

30 expression, show an obvious increase in body size, and 
exhibit only marginal tetR repression. In contrast 9- 
4Lc and 10-2 animals exhibit lower levels of 
tetracycline induced bGH expression, are of normal 
stature and appear to be efficiently regulated by 

35 tetR. 
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produce smaller than expected products (approx. 215bp 
vs. 619bp for TZ-4 and TZ-1, and approx* 94bp vs. 
498bp for TZ-4 and TZ-3) . The products of this 
5 reaction were cloned and sequenced. The sequence 
revealed the presence of an unexpected intron which 
spanned from near the Xbal site at the start of tetR 
to a splice acceptor just 8 base pairs 5" of the TZ-3 
primer . 

10 

7.1.4. 34 5 REPRESSOR CONSTRUCT 

In an embodiment of the invention, any nuclear 
localization signal may be added to a natural or 
synthetic tetR gene to facilitate its expression. For 

15 example, complementary oligonucleotides which encode a 
nuclear localization signal sequence were synthesized 
(Oligos etc.) and added in frame to the tetR coding 
sequences of pSTETR107 at the EcoRl and Xbal 
restriction sites to produce pNTETR. Oligonucleotide 

2 0 sequences are : 

( GB 1 ) 5 • AATTCGCCAGCCATGCCAAAAAAGAAGAGGAAGGTAT3 • and 
(GB2 ) 5 1 CTAGATACCTTCCTCTTCl^rTTTTGGCATGGCTGGC3 1 . 
When annealed these oligonucleotides have a 5 1 EcoRl 
and 3 1 Xbal compatible overhangs. These 

25 oligonucleotides fuse the amino acid sequence Met Pro 
Lys Lys Lys Arg, Lys Val,to the third amino acid (Arg) 
of wild type tetR. 

Two complementary 51 base pair oligonucleotides 
which start the 5' cap site of bGH and extend to the 

30 first exon were synthesized (Oligos etc.). Sequence 
for the oligonucleotides are (5b-l) : 

5 • GATCCCAGGACCCAGTTCACCAGACGACTCAGGGTCCTGTGGACAGCT 
CAG3 • 

and (5b-2) : 

35 5 • AATTCTGAGCTGTCCACAGG ACCCTGAGTCGTCTGGTGAACTGGGTCC 
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filters were done in accordance with standard 
techniques of molecular biology. 

5 7. 1.2. EXPRESSION AND ALTERNATIVE PROCESSING 

OF THE tetR TRANSGENE 

A RNase protection probe which extended from the 
Nrul site of tetR 3« to the end of the gene was used. 
This probe includes only tetR coding sequences and 

10 should give a fully protected fragment of 

approximately 400 base pair. When hybridized to 150/ug 
of liver RNA (500,000 cpm of probe in a 30/il 
hybridization consisting of 80% formamide, 4 0mM PIPES 
pH 6.4, 4 00mM NaOAc , and ImM EDTA) , and digested with 

xs RNase one (Pr omega) for 30 minutes at 37° as 

recommended by the manufacturer, a protected fragment 
of approximately 221-260 base pairs is observed, far 
smaller- than predicted. 

20 7.1.3. 5' STRUCTURE OF tetR mRNA 

Liver RNA was treated with reverse transcriptase 
and amplified by PCR using the manufacturers 
recommended conditions (Pharmacia) . The RNA was 
amplified using two different pairs of primers. The 

25 first primer pair (TZ-1 and TZ-4) should produce a 619 
base pair product. The second primer pair (TZ-3 and 
TZ-4) should produce a 498 base pair product. The 
sequence of the primers are: 
TZ-1: 5 • CCGCATATGATCAATTCAAGGCCGAATAAG 3 1 

3 o TZ-3 : 5 • CTTTAGCGACTTGATGCTCTTGATCTTCCA3 • 
TZ-4 : 5 1 AATTCGCCAGCCATGCCAAAAAAGAAGAGG3 1 

The TZ-4 primer is common to both primer pairs 
and is the 5 1 primer which encompasses the start codon 
of the tetR mRNA. Primer TZ-1 and TZ-3 are two 

35 different 3 1 primers both of which are in the tetR 
coding region. When amplified, these primer pairs 
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Sail and Sacl. This fragment was gel purified and 
coinjected with the PCK-NbGH gene previously described 
to generate transgenic mice. 

5 

7.1.5. SYNTHETIC tetR COMPONENT SEQUENCES 
The components of the synthetic tetR gene were 
synthesized by Midland Laboratories as four 
overlapping double stranded DNA cassettes. The 

10 sequence of these cassettes are shown in Figure 15. 

Each cassette was blunt cloned into the Hinc2 site of 
pUC19 and sequenced to verify authenticity. The 
resulting plasmids pLTl, pLT2 , pLT3 and pLT5 can be 
used as the source material to assemble the entire 

15 synthetic tetR coding sequence since each contains an 
overlapping unique restriction site (bold face) 
through which they can be joined (pLT-1, pLT-2, pLT-3 
and pLT-5 have been deposited with ATCC and have been 
assigned accession numbers , , , and 

20 respectively) . There are many possible ways by which 
these cassettes can be joined. By way of an example, 
the inserts of plasmid pLTl^and pLT2 can be excised 
using EcoRl and Nsil. The ^nserts can then be 
combined by cloning these two fragments into an EcoRl 

25 vector. This procedure will assemble the 5' half of 
the gene, using the overlapping Nsil restriction site 
to join the pieces. Similarly, the 3 1 half of the 
gene can be assembled from pLT3 and pLT5 by cutting 
with EcoRl and Sphl (pLT3) and Sphl and Hind III 

3Q (pLT5) to release the inserts. These inserts can then 
be joined at the overlapping Sphl site by cloning the 
fragments into an EcoRl, Hind III cut vector. 
Finally, the entire coding region can be put together 
using the overlapping restriction site ApaLl. This 

35 would result in a vector with the synthetic tetR 
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TGG3 ' . When annealed these oligonucleotides have 5 ■ 
BamHl and 3 9 EcoRl compatible overhands . The 
oligonucleotides for the 5* leader sequence of bGH 
5 were cloned into a BamHl, EcoRl cut plasmid to produce 
p5'GH. 

The nuclear localization modified tetR coding 
sequence was isolated by gel purification after 
restriction digestion of pNTETR using EcoRl and Hind 

10 III* This fragment was then inserted into p5'GH at 
the EcoRl and Hind III sites to product pS'GHTR. 

To add the remainder of the bGH genomic sequence 
an intermediate modification of p5 9 GHTR was first 
made. This modification consisted of adding a 

15 Hind III - Pstl linker to the Hind III site of p5 9 GHTR 
to product pGTO. The sequence of the oligonucleotides 
which comprise this linker are: (CC-1) 
5 9 AGCTTCTGCAG3 9 and ( CC-2 ) 5 1 AGCTCTGCAGA3 9 . The 
remaining bGH genomic sequences were added in two 

20 steps. First the Pstl Sac2 fragment that begins in 

the first exon of bGH and ends in the third intron was 
excised from pSGH107. Similarly, the insert of pGTO 
which contains the 5 1 untranslated leader of bGH and 
the nuclear localization modified tetR was excised 

25 using BamHl and Pstl. These two gel purified 

fragments was then cloned into a BamHl Sac2 cut vector 
to produce pGTG. Finally, the remainder of the bGH 
gene from the Sac2 site in the third intron to the end 
of the gene, was added to pGTG by cutting pGTG with 

30 Sac2 and adding the Sac2 fragment from pSGH106 to 
produce pNTETR-GH. 

Plasmid pNTETR-GH was digested with BamHl to 
excise the NTETR-GH gene. The fragment was cloned 
into the BamHl site of pPCK 305 to produce the final 

35 plasmid pPCK-GHNTET. To produce transgenic mice, the 
PEPCK-GHTET gene was excised from the plasmid using 
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tetR gene is vastly improved. The graph depicts the 
results when the synthetic tetR coding sequence is 
compared to the same mouse codon bias table used 
5 previously. 

7.2 RESULTS 

7.2.1- EXPRESSION OF tetR IN CONST RUCT 345 OFFSPRING 
To improve tetR expression a new repressor 

10 construct was produced. The construct, referred to as 
Construct 345 is depicted in Figure 10. In the 345 
construct the coding region of tetR is augmented with 
a nuclear localization signal sequence to increase the 
nuclear concentration of repressor. The tetR coding 

15 region was inserted into the first exon of the bGH 
gene. The bGH gene then acts as a genomic carrier , 
providing multiple introns, which may improve 
expression, and a strong polyadenylation signal, which 
may improve the processing and stability of the 

2Q message. 

The new repressor was coinjected with the bGH 
gene from construct 252. The resulting transgenic 
animals contain the new repressor, and a PEPCK 
regulated bGH gene with the tetR operators located 

25 just 3» of the PEPCK TATA-box element. Offspring of 
these animals were screened for bGH induction (FIG. 
11). Of the lines tested only one, line 14, showed 
tetracycline dependent regulation of bGH, and in this 
one case there was still a significant base level of 

30 bGH expression. Northern analysis, performed to 
determine the levels of tetR mRNA expressed in the 
transgenic mice, indicated that the tetR gene was 
still not expressed at a high level. 

To detect tetR mRNA with higher sensitivity the 

35 tetR mRNA was analyzed using RNase protection. This 
technique revealed that the mRNA was shorter then 
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coding sequence, as depicted in Figure 16, cloned into 
a plasmid as an EcoRl Hind III fragment. 

5 7.1.6. COMPOSITIONAL ANALYSIS OF 

WILD TYPE TnlO tetR GENE 

The TnlO tetR coding sequence was analyzed on a 
desktop computer using Mac Vector software* Figure 14 
shows a diagram of the tetR coding region with all of 

10 the plus strand splice doner (D) and splice acceptor 
(A) signal sequences indicated. For reference the 
location of the Xbal restriction is also indicated. 
The first graph depicts the percentage of G and C 
bases in the coding region of tetR. There are several 

15 domains of very low GC content. The bottom graph is 
an analysis of codon bias. The dark line is a 
comparison of the tetR codon usage to a mouse codon 
bias table. Values much lower than 1.0 are indicative 
of sequences which may translate poorly. For 

2o reference, a comparison of tetR to a Tobacco codon 
bias table is included (light line) . In transgenic 
tobacco, the tetR regulation* system functions very 
efficiently, suggesting that for this gene, codon bias 
may be an important factor fbr efficient expression. 

25 

7.1.7. COMPOSITIONAL ANALYSIS OF SYNTHETIC tetR 
Figure 17 depicts the structure of the synthetic 
tetR gene, now devoid of splice donor signal 
sequences, with only a single splice acceptor signal 

30 remaining (A) . This is not the splice acceptor which 
was active in the 345 construct. The percentage of G 
and c bases has been significantly improved, while the 
frequency of CpG base pairs has been kept to a 
minimum. A CpG base pair is frequently the site for 

35 DNA methylation, which can negatively effect the 

expression of a gene. The codon bias of the synthetic 
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greatly improved codon usage for mammalian cells. The 
sequence of the of the syn-tetR is indicated in Figure 
16. The predicted analysis for splicing signals, G+C 
5 content, and codon usage are depicted in Figure 17. 

8. DEPOSIT OF MICROORGANISMS 
The following microorganisms have been deposited 
with the American Type Culture Collection , (ATCC) , 
10 Rockville, Maryland and have been assigned the 
following accession numbers: 

Microorganism Date of Deposit Accession No. 

pI/T-1 August 25, 1993 

pLT-2 August 25, 1993 

15 pLT-3 August 25, 1993 

pLT-5 August 25, 1993 

pPCK_NbGH August 25, 1993 

The present invention is not to be limited in 
scope by the microorganisms deposited since the 

2 0 deposited embodiments are intended as illustrations of 
single aspects of the invention and any microorganisms 
which are functionally equivalent are within the scope 
of the invention. \ 

The present invention is not to be limited in 

25 scope by the exemplified embodiments which are 

intended as illustrations of single aspects of the 
invention, and any clones, DNA or amino acid sequences 
which are functionally equivalent are within the scope 
of the invention. Indeed, various modifications of 

30 the invention in addition to those described herein 
will become apparent to those skilled in the art from 
the foregoing description and accompanying drawings • 
Such modifications are intended to fall within the 
scope of the appended claims. 

35 
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expected (FIG. 12) . Subsequent analysis using reverse 
transcriptase-PCR with primers that amplify the entire 
coding region of tetR confirmed that the mRNA was 
5 significantly shorter then expected (FIG. 13) . 

Sequence analysis of these RT-PCR products indicated 
that an unexpected splicing event had occurred. This 
splicing process occurred between a splice donor 
signal in the 5' end of the tetR coding region and a 
10 splice acceptor approximately 400 bp 3 1 of the start 

codon. The resulting mRNA is therefore deleted of the 
tetR DNA binding domain and about two third of the 
entire coding region. This mRNA could not possibly 
make a functional repressor. 

15 

7.2.2. OPTIMIZATION OF tetR CONSTRUCT 
A more detailed analysis of the tetR coding 
sequence indicated that the codons used in this gene 
are poorly suited for expression in mammalian cells 

2o (FIG. 14). Therefore, it appears that the 

inefficiency of the tetR system is the result of two 
processes: (i) aberrant splicing of the RNA to 
produce a nonfunctional message; and (ii) inefficient 
translation which can lead to rapid mRNA turnover. 

25 To circumvent the problems of internal splicing 

and potential problems due to codon bias and G-C 
content, a synthetic tetR gene was designed. The 
components of the synthetic tetR gene were synthesized 
as four overlapping double stranded cassettes. Each 

30 cassette was cloned in puc!9. The resulting plasmids 
designated pLT-1, pLT-2, pLT-3 and pLT-5, as depicted 
in Figure 15, have been deposited with ATCC and 

assigned accession numbers , , , and 

/ respectively. The synthetic tetR (syn-tetR) 

35 has been designed to encode exactly the same protein 
product, but is devoid of splice signals and has 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Byrne, Guerard 

<ii) TITLE OF INVENTION: TETRACYCLINE REPRESSOR— MEDIATED BINARY 
REGULATION SYSTEM FOR CONTROL OF GENE EXPRESSION IN 
TRANSGENIC ANIMALS 

(iii) NUMBER OF SEQUENCES: 15 

<iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036-2711 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEMS PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1-25 

(vi) CURRENT APPLICATION DATA: 

<A) APPLICATION NUMBER: US 07/935,763 

(B ) FILING DATE: 26-AUG-1992 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Coruzzi, Laura A. 

(B) REGISTRATION NUMBER: 30,742 

(C) REFERENCE /DOCKET NUMBER: 6794-025 

(ix) TELECOMMUNICATION INFORMATION: \ 

(A) TELEPHONE: 212 790-9090 \ 

(B) TELEFAX: 212 869-8864/9741 \ 

(C) TELEX: 66141 PENNIE '< 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: Is 
TTGACACTCT ATCATTGATA GAG TTATTTT AOCACTCCCT ATCAGTGATA GAGAAAAGT 59 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 
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It is also -bo be understood that all base pair 
sizes given for nucleotides are approximate and are 
used for purposes of description. 
5 Various publications are cited herein, which are 

hereby incorporated by reference in their entirety. 
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Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

AAG CTA GGT GTA GAG CAG CCT ACA TTG TAT TGG CAT GTA AAA AAT AAG 144 
Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 192 
Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 
50 55 60 

ACT CAC TTT TGC CCT TTA GAA GGG GAA AGC TGG CAA GAT TTT TTA CGT 240 
Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 - 70 75 80 

AAT AAC GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT CAT CGC GAT GGA 288 
Asn Asn Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 
85 90 95 

GCA AAA GTA CAT TTA GGT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 336 
Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 384 
Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 432 
Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 480 
Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 

CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 528 
Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 i 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA \TTC GGC CTT GAA TTG 576 
Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu phe Gly Leu Glu Leu 
180 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCT TAA 624 
lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 

<2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Met Ser Arg Leu Asp Lvs Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
1 5 10 15 

Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr A-c Lys Leu Ala Gin 
20 * 25 30 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
GAATTCGATA CTCTATCATT GATAGAGTAT CAAGCTTATC CCTATCAGTG ATAGAGATAC 60 
CGTCGACCTC 70 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ACTCTATCAT TGATAGAGTT ACTATTTAAA TCCCTATCAG TGATAGAGA 49 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 

GGAATTCGAT ACTCTATCAT TGATAGAGTA TCAAGCTTAT CCCTATCAGT GATAGAGATA 60 
CCGTCGACCT C 7 2 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

( ix ) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .624 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGO GCA TTA GAG CTG 4 6 

Met Ser Arg Leu Asp Lys Ser Lys Val He Asn S jx Ala Leu Glu Leu 
1 5 10 15 

CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC CAG 9 6 
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(2) INFORMATION FOR SEQ ID NO: 9 r 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GTATTATGTT TTATGTTACT GTAAAAGATG TAAAGAGAGG CACGTGGTTA AGCTCTCGGG 
GTGTGGACTC CACC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CGCCCCAAGC ATAAACCCTG GCGCGCTCGC GGCCCGGCAC TCTTCTGGTC CCCACAGACT 60 
CAGAGAGAAC CCA 73 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 74 base pairs y 

(B) TYPE: nucleic acid » 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: II: 
TAGGCAGCAG GCATATGGGA TGGGATATAA AGGGGCTGGA G C ACTG AG AG CTGTCAGAGA 
TTTCTCCAAC CCAG 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(Al LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
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Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 
50 55 60 

Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

Asn Asn Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 
85 90 95 

Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 1S5 160 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

He He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH : 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown i 

(ii) MOLECULE TYPE: DNA (genomic) \ 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CGGCCCTATA AAAAGCGAAG CGCGCGGCGG GCGGGAGTCG CTGCGTTGCC TTCGCCCCGT 60 
GCCCCGCTCC GCGCCGCCTC GCGCCGCCCG CC 92 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AAGAAGTATA TTAGAGCGAG TCTTTCTGCA CACACGATCA COTTTCCTAT CAACCCCACT 60 
A 61 
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Intemational Application No: PCTV / 

MICROORGANISMS 

Optional Shoot in connection with tho microorganism referred to on page 38, tines 7-23 of the description ' 

A. IDENTIFICATION OF DEPOSIT » 

Further deposits are identified on on additional ohoet ' 

Name of depositary institution * 
American Type Culture Collection 



Address of depositary institution (including postal code and country) 4 

12301 Parklawn Drive 
RockvUle, MD 10582 
US 



Date of deposit * August 25, 1993 Accession Number * N/A 

B. ADDITIONAL INDICATIONS » (leave bUnk if not eppBcebk). ThU jefcroiion b continued on a gwte attached ahcet B 



C. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE * nra.ii 



D. SEPARATE FURNISHING OF INDICATIONS * fere 4mk if a* applicable) 



* (leave j|ank i 



The tndjcatjom listed below will be submitted to the International Bureau later • (Specify the peneral nature of the indications e.g.. 
"Accession Number of Deposit") 



I Bureau I 



E. CJ| This sheet was received with the International application when filed (to be checked by the receiving Office) 

(Authorised Officer) J 
□ The date of receipt (from the applicant) by the International Bureau ■ 



(Authorized Officer) 



Form PCT/RO/134 (January 1981) 
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ACTCTATCAT TGATAGAGT 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
ACTCTATCAA TGATAGAGT 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
TCCCTATCAG TGATAGAGA 
(2) INFORMATION FOR SEQ ID NO: 15: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
TCTCTATCAC TGATAGGGA 
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25 



30 



WHAT IS CLAIMED IS: 

1. A substantially purified and isolated 
nucleic acid molecule comprising an animal promoter 
element that comprises a tetR operator sequence. 

2. The nucleic acid molecule of claim 1 in 
which the tetR operator sequence is positioned 3 1 to a 
TATA-box sequence. 

3. The nucleic acid molecule of claim 1 in 
which the promoter element is the PEPCK promoter. 

4. The nucleic acid molecule of claim 3 in 
which the tetR operator sequence has been inserted 
into the Nhel site of the PEPCK promoter element. 

5. The nucleic acid molecule of claim 1, 2, 3 
or 4 in which the promoter element controls the 
expression of a gene of interest. 

6. The nucleic acid molecule of claim 5 in 
which the gene of interest i\s bovine growth hormone. 



7. A non-human transgenic animal that carries, 
as a transgene, the nucleic acid molecule of claim 1, 
2, 3 or 4. 

8. A non-human transgenic animal that carries, 
as a transgene, the nucleic acid molecule of claim 5. 

9. A non-human transgenic animal that carries, 
as a transgene, the nucleic acid molecule of claim 6. 
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17 . The transgenic animal of claim 15 that is a 

pig- 

18. A substantially purified and isolated 
nucleic acid molecule comprising an optimized tetR 
gene as depicted in Figure 16. 

19. The non-human transgenic animal of claim 7 
that further carries an optimized transgene encoding 
the tetR repressor protein and having a sequence as 
depicted in Figure 16. 

20. The non-human transgenic animal of claim 8 
15 that further carries an optimized transgene encoding 

the tetR repressor protein and having a sequence as 
depicted in Figure 16. 

21. The non-human transgenic animal of claim 9 
2 0 that further carries an optimized transgene encoding 

the tetR repressor protein and having a sequence as 
depicted in Figure 16. 



30 



22. A non-human transgenic animal that carries 
25 an optimized transgene encoding the tetR repressor 
protein and having a sequence as depicted in Figure 
16 . 



23. A method of selectively inducing the 
expression of a gene of interest in a non-human 
transgenic animal comprising administering a 
tetracycline compound to a non-human transgenic animal 
that carries a first transgene which is a gene of 
interest under the control of a promoter element 
35 modified to comprise a tetR operator sequence and a 

second optimized transgene encoding the tetR repressor 
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10. The non-human transgenic animal of claim 7 
that: further carries a transgene encoding the tetR 
repressor protein, 

5 

11. The non-human transgenic animal of claim 8 
that further carries a transgene encoding the tetR 
repressor protein. 

10 12 • The non-human transgenic animal of claim 9 

that further carries a transgene encoding the tetR 
repressor protein. 

13. A non-human transgenic animal that carries a 
15 transgene encoding the tetR repressor protein. 

14 . A method of selectively inducing the 
expression of a gene of interest in a non-human 
transgenic animal comprising administering a 

2o tetracycline compound to a non-human transgenic animal 
that carries a first transgene which is a gene of 
interest under the control of a promoter element 
modified to comprise a tetR \ operator sequence and a 
second transgene encoding the tetR repressor protein. 

25 

15. A non-human transgenic animal that carries 
(i) a first transgene that encodes bovine growth 
hormone and is under the control of PEPCK promoter 
element modified to contain a tetR operator at the 

30 Nhel site; and (ii) a second transgene that encodes 
tetR repressor protein. 

16. The transgenic animal of claim 15 that is a 
mouse . 

35 
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protein and having a sequence as depicted in Figure 
16. 

5 24 . A non-human transgenic animal that carries 

(i) a first transgene that encodes bovine growth 
hormone and is under the control of PEPCK promoter 
element modified to contain a tetR operator at the 
Nhel site; and (ii) a second optimized transgene that 
10 encodes tetR repressor protein that has a sequence as 
depicted in Figure 16. 

25. The transgenic animal of claim 24 that is a 
mouse . 

15 

26. The transgenic animal of claim 24 that is a 

pig. 



20 

i 

\ 

i 

25 



30 



35 
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MODIFIED PEPCK PROMOTERS 



251 (5' TATA) 




252 (3* TATA) 




261 (Flank TATA) 




BNSDOCID: <WO 9404672A1_»_> 



WO 94/04672 



2/ 19 



PCI7US93/08230 



EcoRl OP1 unk^ 

ggaattcgat-ACT CTA TCA TTG ATA GAG TATCAA fifTTAT C CC 

OP2 AccI 

TAT CAG TGA TAG AGA-taccgtcgacctc 



f~~) G-O >ee 16. 
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S\ Nuclease Protection: 5' Start Site 

» 252:9-5 252:9-4 252:9-4 

§ He Lc 
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320 



330 



340 



350 



360 
* 



ACA GAA AAA CAG TAT GAA ACT CTC GAA AAT CAA TTA GCC TTT TTA 
TEK QYETLENQLAFL> 

TETRACYCLINE REPRESSOR PROTEIN (TETR) ; C0DGN_START=1 > 

b b b TETR REPRESSOR MRNA ( SPLIT ]_b 1> 1> P > 

<1210_a_a___1200_903 TO 1526 OF TRN10TETR_0_a a 1170a 



370 



380 



390 



400 



TGC CAA CAA GOT TTT TCA CTA GAG AAT GCA TTA TAT <^ <^ AGC 

C TETRACYCLINE REPRESSOR^PROTEIN (TETR) ; CCDCW_START=1_ > 

K y> b T ETR REPRESSOR MRNA [SPLIT ]_b P- — » P > 

< a~1160__a a_503 TO 1526 OF TRN10TETR_a_1130 — a a 



440 



450 



410 420 430 

GCT GTG GGG CAT TTT ACT TTA GOT TGC TTG GAA GAT CAA GAG 

A TETRACYCLINE REPRESSOR^RDTEIN (TJ^^ COTX*^ART=l_ 
> fs * T ETR REPRESSOR MRNA^ [SPLXT]_fr b_ b- 



r-v r> i&iA * — - - — 

!^T a a 1 110-903 TO 1526 OF TRN10TETR_0_2 



460 



470 



480 

* 



1080 



490 




CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA CCT ACT ACT GAT ACT 

H TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODDN^TART=l^ > 

K b b TETR REPRESSOR MRNA [ SPLIT] _b fe b b - > 

< a_1070 — « a 903 TO 1526 Of\ TRN10TETR_a_1040 — A. 



500 



510 



520 



530 



S40 



ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA TTT GAT CAC CAA 

TETRACYCLINE REPRESSOR PROTEIN (TETR) ; COD0N^START= 1 > 

b b b T ETR REPRESSOR MRNA [SPLIT] _fc> P £>— f> > 

<10l0a 1020^903 TO 1526 OF TRNl0TETR^0_a a a990a 



550 



560 



570 



580 



GCT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG ATC ATA TGC 

G TETRACYCLINE REPReLoR^ROTEIN ^JU COCO^^l * 

b Jo. b TETR REPRESSOR MRNA C SPLIT] _b b P « 

< a~38 0Za_&-.90 3 TO 1526 OF TRN10TETB — a — 950 — e 



590 



600 



610 



620 



GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCT TAA 
CLEKQLKCESGS > 

TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODON > 

b b "*ETR REPRESSOR MRNA [Sf -,IT)_j2 b b > 

^qlfT a a * 903 TO 1526 OF TRN10TETF S10_a a 
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10 



20 



30 



40 



ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGC GCA TTA GAG 
MSRLDKSKVINSALE> 

T ETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODQN_JSTART=l > 

b b b T ETR REPRESSOR MRNA [SPLIT] __£>___£> b b. > 

< a_1520_cU__a_9O3 TO 1526 OF TRN10TETR — a_1490 — « 



50 



60 
* 



70 



80 

★ 



90 



CTG CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC 
LL,NEVGIEGLTTRKL> 

TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODON_START=l > 

~~"b b b T ETR REPRESSOR MRNA [ SPLIT b b *> > 

^148 0 a a 1 470 903 TO 1526 OF TRN10TETR_0_a a 1440c* 



100 



110 



120 



130 



GCC CAG AAG CTA GGT GTA GAG CAG OCT ACA TTG TAT TGG CAT GTA 
AQKLGVEQPT LYWH V> 

__TETRACYCIJCNE REPRESSOR PROTEIN (TETR) ; CODO*t_START=l. > 

_ b b b T ETR REPRESSOR MRNA [ SPLIT ]_b b 1> fcL 

< a_1430 a_a_9Q3 TO 1526 OF TRN10TETR_a_1400_ 



140 



150 



160 



170 




AAA AAT AAG CGG GOT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA 
KNKRALLDALAIE M L> 

_TETRACYCIiINE REPRESSOR PROTEIN (TETR) ; COD0fcl_START=l > 

b b b T ETR REPRESSOR MRNA [ SPLI T )_JD b *> fc> > 

<1390_a__a__J.380_903 TO 1526 OF TF^l 0TETR_0__a a 1350a 

190 200 \ 210 220 

GAT AGG CAC CAT ACT CAC TTT TGC CCT TTA GAA GGG GAA AGC TGG 
DRHHTHFCPLBGESW> 

TETRACYCLINE REPRESSOR PROTEIN (TETR); CODON_START=l > 

b b b T ETR REPRESSOR MRNA [SPLIT] Jp — _fc> k k > 

< a 1340 a a 903 TO 1526 OF TRN10TETR a_1310 — a a 



230 



240 



250 



260 



270 



CAA GAT TTT TTA CGT AAT AAC GCT AAA AGT TTT AGA TGT GCT TTA 

qdflrnnaks FRCA l> 

TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODON_START=1 > 

b b . b TETR REPRESSOR MRNA (SPLIT] _£> fc> > 

<1300_a a 1290^903 TO 1526 OF TRN10TETR_0_a a 1260a 



280 



290 



300 



310 



CTA AGT CAT CGC GAT GGA GCA AAA GTA CAT TTA GGT ACA CGG CCT 
LSHRDGAKVH LjGT r p> 

TETRACYCLINE REPRESSOR PROTEIN (TETR) ; CODON_START=3 ; 5 

b _ _ b b TETR REPRESSOR MRNA \ SPLIT] _b b , b b - 

< a_l250 a a 903 TO 1526 OF TRN10TETP; a_1220 a a 
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INDUCTION OF BOVINE GROWTH HORMONE 
mRNA BY TETRACYCLINE 



CONSTRUCT 
ANTIBIOTIC 
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Tissue Specificity, and Tetracycline Induction 
of bGH in Line 10-2 
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Induction ofbGHin Construct 345 Offspring 



FOUNDER 32 33 31 14 19 63 63 44 54 

TETRACYCLINE _ + - + 1 + + | - -f -+-+-+-+- + 
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345 Repressor Construct 



Xba 



Xba Nru TAA 

J L 



PEPCK NtetR 



bGH 



PolyA 
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FIGURE S3 



5' Structure of tetR mRNA 



202 203 210 301 304 




3' primer g g g g g g ; ^ g £ g 
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Expression and Alternative 
Processing of the tetR Transgene 




Protected fragments 



406 
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Synthetic tetR Component Sequences 

LT1 

EcoR5 and EcoRl 

gatatcgaattcatgagtagattc^ 

TCTGGAGCimTCAATGAAGTCKKSCAT^ 

TGGCCCAGAAGCTGGGAGTGGAGC^GCCAACATTGTACTG 

AATAAGAGGGCTCrGCTGGATGCATTGGCGGTACCAGGC 

Nsil Kpnl 

LT-2 

Kpnl Ndi 



GCTCGGTAGCTXKJATCCATTGGCCATTCAGATGCTCGAC^ 

acacttctgccx1actggaaggcgagagttggcagg 

atgctaagagtttcagatgtgctctgtig 

gtgcac ct ggaattc gagc 

ApaLl EcoRl 
LT-3 

EcoRl ApaLl 
GCTCGAATTCAAAGTGCACCTGGGTACAAGGC^ 

agaccctggagaaccagctggcatttctgtc^ 
gagaatgcattgtatgctctgagtgctgtggcrrc 

TCTCCTGGACHjACCAGGAGCACCAGGTGGCCAAGGAC^ 

CAACCACTGACAGCATGCCCCGGATCCGAGC 
Sphl BanHl 

LT-5 

BaniHl Sphl 

GCTCGGATCGACAGCATGCCCCCATTGCnXjAGACAGGCCTATGAGCTXjTT 
TGACCACCAAGGGGCAGAGCCTGClXTTCTCriTT 

ICTGTGGTCTGGAGAAGCAGCTGAAGTGTGAGaGTGGCTCCT GAAGCTTG 
ATATC Hind3/EcoR5 
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Composition Analysis of Wild Type TnlO tetR Gene. 
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Codon Window Size = 25 



Codon Bias File: Mouse codon bias 
Genetic^Code: universal 
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Compositional analysis of Synthetic tetR 
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Genetic vpode: universal 









r 



















































BNSDOCID: <WO 9404672A1 I > 



WO 94/04672 



18/ 19 



PCT/US93/08230 



Sequence of Synthetic tetR Gene. 



GATATCGAAT TCATGAGTAG 

TCAATAGTGC TCTGGAGCTG 

AGGTCTGACT ACCAGAAAGC 

GAGCAGCCAA CATTGTACTG 

CTCTGCTGGA TGCATTGGCC 

CCATACACAC TTCTGCCCAC 

GACTTCCTGA GGAACAATGC 

TGTTGAGCCA CAGAGACGGT 

AAGGCCAACA GAGAAACAGT 

CTGGCATTTC TGTGCCAACA 

CATTGTATGC TCTGAGTGCT 

TTGTGTCCTG GAGGACCAGG 

GAGAGGGAGA CCCCAACCAC 

TGAGACAGGC CATAGAGCTG 

GCCTGCTTTT CTGTTTGGCC 

CTGGAGAAGC AGCTGAAGTG 
TGATATC 



ATTGGACAAG AGCAAAGTGA 

TTGAATGAAG TGGGCATAGA 

TGGCCCAGAA GCTGGGAGTG 

GCATGTGAAG AATAAGAGGG 

ATTGAGATGC TGGACAGACA 

TGGAAGGCGA GAGTTGGCAG 

TAAGAGTTTC AGATGTGCTC 

GCTAAAGTGC ACCTGGGTAC 

ACGAGACCCT GGAGAACCAG 

AGGCTTCAGC CTGGAGAATG 

GTGGGTCACT TCACACTGGG 

AGCACCAGGT GGCCAAGGAG 

TGA<5AGCATG CCCCCATTGC 

TTTGACCACC AAGGGGCAGA 

TGGAiGCTCAT CATCTGTGGT 

TGAGAGTGGC TCCTGAAGCT 
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